Conversational Engagement Recognition Using Auditory and Visual Cues

نویسندگان

  • Yuyun Huang
  • Emer Gilmartin
  • Nick Campbell
چکیده

Automatic prediction of engagement in human-human and human-machine dyadic and multiparty interaction scenarios could greatly aid in evaluation of the success of communication. A corpus of eight face-to-face dyadic casual conversations was recorded and used as the basis for an engagement study, which examined the effectiveness of several methods of engagement level recognition. A convolutional neural network based analysis was seen to be the most effective.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speaker Dependency Analysis, Audiovisual Fusion Cues and a Multimodal BLSTM for Conversational Engagement Recognition

Conversational engagement is a multimodal phenomenon and an essential cue to assess both human-human and human-robot communication. Speaker-dependent and speaker-independent scenarios were addressed in our engagement study. Handcrafted audio-visual features were used. Fixed window sizes for feature fusion method were analysed. Novel dynamic window size selection and multimodal bi-directional lo...

متن کامل

Towards Context-Based Visual Feedback Recognition for Embodied Agents

Head pose and gesture offer several key conversational grounding cues and are used extensively in face-to-face interaction among people. We investigate how contextual information can improve visual recognition of feedback gestures during interactions with embodied conversational agents. We present a visual recognition model that integrates cues from the spoken dialogue of an embodied agent with...

متن کامل

Using Eye Movement Analysis to Study Auditory Effects on Visual Memory Recall

Recent studies in affective computing are focused on sensing human cognitive context using biosignals. In this study, electrooculography (EOG) was utilized to investigate memory recall accessibility via eye movement patterns. 12 subjects were participated in our experiment wherein pictures from four categories were presented. Each category contained nine pictures of which three were presented t...

متن کامل

Is it Possible to Evaluate the Contribution of Visual Information to the Process of Speech Comprehension?

We report in this paper the results of a series of comprehension tests run with the aim of investigating the contribution of visual information to the process of comprehension of conversational speech. The methodology we designed was presented in a previous work [1] in which we also showed the results of a pilot test to confirm our original hypothesis that the comprehension of conversational sp...

متن کامل

Auditory and auditory-visual recognition of clear and conversational speech by older adults.

Research has shown that speech articulated in a clear manner is easier to understand than conversationally spoken speech in both the auditory-only (A-only) and auditory-visual (AV) domains. Because this research has been conducted using younger adults, it is unknown whether age-related changes in auditory and/or visual processing affect older adults' ability to benefit when a talker speaks clea...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016